Picture for Lei Zhu

Lei Zhu

MaskedCLIP: Bridging the Masked and CLIP Space for Semi-Supervised Medical Vision-Language Pre-training

Add code
Jul 23, 2025
Viaarxiv icon

The Synergy Dilemma of Long-CoT SFT and RL: Investigating Post-Training Techniques for Reasoning VLMs

Add code
Jul 10, 2025
Viaarxiv icon

AdvMIM: Adversarial Masked Image Modeling for Semi-Supervised Medical Image Segmentation

Add code
Jun 25, 2025
Viaarxiv icon

Surgery-R1: Advancing Surgical-VQLA with Reasoning Multimodal Large Language Model via Reinforcement Learning

Add code
Jun 24, 2025
Viaarxiv icon

PosterCraft: Rethinking High-Quality Aesthetic Poster Generation in a Unified Framework

Add code
Jun 12, 2025
Viaarxiv icon

Time-Lapse Video-Based Embryo Grading via Complementary Spatial-Temporal Pattern Mining

Add code
Jun 05, 2025
Viaarxiv icon

PhotoArtAgent: Intelligent Photo Retouching with Language Model-Based Artist Agents

Add code
May 29, 2025
Viaarxiv icon

Faster and Better LLMs via Latency-Aware Test-Time Scaling

Add code
May 26, 2025
Viaarxiv icon

MoESD: Unveil Speculative Decoding's Potential for Accelerating Sparse MoE

Add code
May 26, 2025
Viaarxiv icon

Semantic-enhanced Co-attention Prompt Learning for Non-overlapping Cross-Domain Recommendation

Add code
May 25, 2025
Viaarxiv icon